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Metbod and Appa.rat;ix8 £or doBcribitig sound sourcaa 

The invention relates to a method and to an apparatus for 
5 describing ^ound sourcesa, especially for sound soiarces 

encoded as audio objects according to the MPEG- 4 Audio stan- 
dard. 

10 Backerround 

The MPEG-4 Audio standard a$ defined in ISO/IEC 14495-3 and 
14496-1 facilitates a wide variety of applications by sup- 
porting the representation of audio objects. For the combi- 
xs nation of the audio objects additional information - the so- 
called scene description - determines the plac^ent in space 
and time and is transmitted together with the coded audio 
objects . 

For playback the audio objects are decoded separately and 
20 coiftposed using the scene description in order to prepare a 
single soundtrack, which is then played to the listener . 

For efficiency, the MPEG-4 Systeias standard ISO/XEC 14496- 
-Idef ines a way to encode the scene description in a binary 
25 representation, the so-called Binary Format for Scene De- 
scription (BIFS) - Correspondingly, audio scenes are de- 
scribed u^ing so-called AudioBIFS. 

A scene description is structured hierarchically aad can be 
30 represented as a graph, wherein leaf -nodes of the graph form 
the separate objects and the other nodes describes the proc- 
essing, e.g. positioning, scaling, effects etc>. The appear- 
ance and behavior of the separate objects can be controlled 
using parameters within the scene description nodes - 

35 
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Tlae invexifcioa is bas^d on t:h.e reeognition o£ the following 
fact. Currently tlie MPEG-4 Audio standeird cannot describe 

5 sound sources ttiat have a certain dimension, like a choir, 
orchestra, sea or rain but only a point source/ e.g. a fly- 
ing insect/ or a single instrument- According to listening 
tests wideness of sound sources is clearly audible, whereby 
more complicate descriptions like the shape of the audio 

10 object is not necessary. 

•Therefore, a problem to be solved by the invention is to al- 
low the description of the wideness of sound sources that 
have a certain dimension in a simple and l>aclcwards 
IS compatible way. 

This problem is solved by the method disclosed in claim 1 
and the corresponding apparatus in claim 4. 

20 xn principle, the inventive method allows to describe sound 
sources/ which are encoded as separate audio objects* The 
arrangement of the sound sources in a sound scene is de- 
scribed by a scene description. For playback the audio ob- 
jects are decoded separately and a single soundtrack is com- 

2S posed from the decoded audio objects using said scene de- 
scription. For describing the wideness of a sound source an 
audio spatial diffuseness node is defined within the scene 
description « 

30 Advantageous additional embodiments of the invention are 
disclosed in the respective dependent claims* 

Drawings 

35 

Exemplary embodiments of tlae invention are described with 
reference to the accompanying drawings, wbich show in 
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Pig, 1 the illustration of tlpie functionality of the 

AudioSpatialDiffuseness mode; 

Pig. 2 ^ Audio Scene for a liine Soxmd Source©. 



Exemplary embodiments 

10 Figure 1 shows an illustration of tlie functionality of tlxe 
inventive AudioSpatialDif fuseness node. 

This AudioSpatialDiffuseness node will have a children field 
as input and will produce the same number of channels (nuan- 
15 Chan) as output- Branches that are connected to an upper 
level branch are called children in terms, it can be 

inserted in each branch of the audio subtree^ without chang- 
ing any other node- 

20 A dif fuseSelection field will allow the scene author to con- 
trol the diffuseness algorithms/ so that each AudioSpa- 
tialDif fuseness node will produce a different output. Xn 
practice a diffuseness node will virtual produce N different 
signals/ but only one real signal is passed through to the 

25 output of the node, signaled by the dif fuseSelect field. 
Other fields like a decorrelation strength (deoorrStrength) etc. 
could be added to the node, if required. 
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AudioSpatialDiffUiseness { 

eventin ifflPNode addChildren 
event in MFNode r^oveChlldrea 
exposedPield MFNode children t 1 

ejqposedField SFInt32 dx££usaSelec1: 1 
exposedField SPlnt32 deeorrestrei^ath. 1 
field SPIrxt32 numChau 1 

field MPInt32 pliaseGroup C J 

} 



Teible Is Seiaatxtics of the proposed AudioSpatlalDif fueeness 
Node 

15 In the case of numChatx greater than one each channel should 
be diffused separately. 

Figure 2 depicts an Audio Scene for a Line Sound Source. By 
using this proposal the scene author has to decide how many 

20 and at which position the decorrelated multiple point sound 
sources will be located. The advantage is, that the content 
author has much more control over the shape effect. He can 
also use intensity and direction of each point source as 
well as using the AudioDelay and AudioEffects node for cer- 

25 tain Sound nodes to manipulate the effect - 

It is still possible for the renderer to reduce the computa- 
tional power by passing the scene tree to look for identical 
AudioSources . 



# Example of a line sound source replaced by three point 
sources 

# using one single decoder output. 

Group C 

children C 

DBF POSl Sound { 
intensity 0,9 
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location 0 0 0 
spatiialis;© TRUE 

source AadioSpabi.alDi££usenes3 

riLuaiCliarL 1 

dif fus^Select 1 



{ 



cliildren [ 

DBF BEACH AudioSource 



{ 



nuniChan 1 
url 100 



} 



] 



} 



DEF BOS2 Sound { 
intensity 0.8 
location -3 0 0 
spatial ize TRtJE 

source AadioSpat±alDie£useness 

nuxaChan 1 

dif fuseSelecfc 2 

children [ USE BEACH] 



DBF POS3 Sound L 
intensitiy 0.8 
location 3 0 0 
spatialise TRUE 

source AudioSpa'ta.a.lDl££usene8a ( 

numClxan 1 
diffuses el act 3 



ciiildren [ USE BEACH] 



} 



] 



) 



35 



Talile 2 s Example of a Line Sound Source replaced by 

tliree Point Sourceis using one single Audio- 
Source . 
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CXaims 



1, Method for describixig sound sources, which, are encoded as 
separate a.u.dio objects, wherein the arraingement of the 
5 sound sources in a sound scene is described by a scene 

description, and wherein for playbaclc the audio objects 
are decoded separately and a single soundtrack is com- 
posed from the decoded audio objects using said scene de- 
scription r cbaracterlzod by an audio diffuseness node 
10 which is defined within the scene description for 

describing the widenoss of a sound source . 



2- Method according to claim 1/ wherein a diffuse selection 
field will allow the scene author to control the 
15 dif fusenes® algorithnis , 



3 • Method according to claim. 1 or 2 , wherein a decorrelation 
strength field will allow author to control the strenght 
of the decorrelation. 

20 

4. Apparatus for performing a method according to any of 
claims 1 to 3 « 
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Abstract: 

The MPEG-4 Audio standard as defined in ISO/IEC 1449S-1 and 
-3 facilitates a wide variety of applications by supporting 
5 tlie representation of audio objects. For tlie combination of 
the audio objects additional information - the so-called 
scene description - determines the placement in space and 
time and is transmitted together with the coded audio ob- 
jects • 

10 For playback the audio objects are decoded separately and 
composed using the scene description in order to prepare a 
single soundtrack, which is then played to the listener. A 
scene description is structured hierarchically and can be 
represented as a graph, wherein nodes of the graph form the 

15 separate objects- The appearance and behaviour of the sepa- 
rate objects can be controlled using parameters within the 
scene description nodes. For describing the wideness of a 
sound source an audio diffuseness node is defined within 
the scene description 

20 

Pig. 1 
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